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Abstract 



For the random eigenvalues with density corresponding to the Jacobi ensemble 



n 

i<j 



A,- — A, 



n(2-A,r(2 + A J ) fe / ( _ 2i2) (A,) 



(a,b > —1,(3 > 0) a strong uniform approximation by the roots of the Jacobi polynomials 
is derived if the parameters a, b, (5 depend on n and n — > oo. Roughly speaking, the 
eigenvalues can be uniformly approximated by roots of Jacobi polynomials with parameters 
((2a + 2)//3 — 1, (26 + 2)//3 — 1), where the error is of order {log n/(a + 6)} 1 / 4 . These results 
are used to investigate the asymptotic properties of the corresponding spectral distribution 
if n — > oo and the parameters a, b and j3 vary with n. We also discuss further applications 
in the context of multivariate random i^-matrices. 
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1 Introduction 

The three classical ensembles of random matrix theory are the Hermite, Laguerre and Jacobi en- 
sembles. The Hermite or Gaussian ensembles arise in physics and are obtained as the eigenvalue 
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distribution of a symmetric matrix with Gaussian entries. The Laguerre or Wishart ensembles 
appear in statistics and correspond to the distribution of the singular values of a Gaussian matrix. 
Similarly, the Jacobi ensembles are objects of statistical interest and are motivated by multivari- 
ate analysis of variance (MANOVA; see Muirhead (1982)). Associated with each ensemble there 
is a real positive parameter (3 which is usually considered for three values. The case (3 = 1 
corresponds to real matrices, while the ensembles for (3 = 2 and (3 = 4 arise from complex and 
quaternion random matrices, respectively, according to Dyson's (1962) threefold classification. 
Dyson also observed that the eigenvalue distributions correspond to the Gibbs distribution for 
the classical Coloumb gas at three different temperatures. In other words - starting from the 
physical interpretation - for many decades it was only known that there exist random matrix 
models for the Coloumb gas at three different temperatures. Recently Dumitriu and Edelman 
(2002) provided tridiagonal random matrix models for the /3-Hermite and /5-Laguerre ensembles 
for all (3 > 0. Dette and Imhof (2007) derived strong uniform approximations of the random 
eigenvalues of these ensembles by the (deterministic) roots of Hermite and Laguerre polynomials. 
The development of a matrix model corresponding to the Jacobi ensemble was an open problem, 
which was recently considered by Lippert (2003) and Killip and Nenciu (2004) and Edelman and 
Sutton (2006). In particular, Killip and Nenciu (2004) found a tridiagonal matrix model for the 
(3- Jacobi ensemble with density 

« 

(1-1) /(A) = cJI |A, - A/ 1[(2 - \ t ) a (2 + A J ) fe / ( _ 2i2) (A J ), 

i<j i=l 

(a, b > —1, (3 > 0), where the entries are simple functions of independent random variables with 
a beta distribution on the interval [—1,1] [see equations (12.21) and (12.31) in Section 2 for more 
details]. It is the purpose of the present paper to obtain further insight in the stochastic properties 
of the random eigenvalues with density given by (jl.ip . In Section 2 we introduce the triangular 
matrix proposed by Killip and Nenciu (2004) and present a uniform approximation of the random 
eigenvalues with density (11.11) by roots of Jacobi polynomials. Roughly speaking, if n — > oo, 
the random eigenvalues can be uniformly approximated by roots of the Jacobi polynomials 
p((2a+2)//3 i,(2fe+2)//3 i) ^^2), where the error of this approximation is of order {logn/ (a + 6)} 1 / 4 . 
These results are used in Section 3 to investigate the asymptotic properties of the spectrum if 
the parameters a, b and (3 vary simultaneously with n and n — > oo. In Section 4 we study as an 
application the eigenvalue distribution of a multivariate F-matrix which was also investigated 
by Silverstein (1985b) and Collins (2005). We present several extensions of these results. In 
particular, we consider almost sure convergence [Silverstein (1985b) discussed convergence in 
probability while Collins (2005) considered the expectation of the empirical spectral distribution] 
and investigate the case, where the parameters and the sample size converge to infinity with 
different order [Silverstein (1985b) and Collins (2005) discussed the case where a ~ 7in, b ~ 72^] • 
Finally, some technical results are given in an appendix. 
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2 Strong approximation of the Jacobi ensemble 



Recall the definition of the Jacobi ensemble in (11.11) and let for p,q > a ~ B(p,q) denote a 
Beta-distributed random variable on the interval [—1,1] with density 



(2.1) 



)i-p-<? 



r(p + g) 

r(p)T(q) 



It was shown by Killip and Nenciu (2004) that if ckq, ot\, . . . , «2n-2 are independent random 
variables with distribution 



B 



2n-fc-2 , 



, (3 + a + 1,^^(3 + b + l) if k even, 
B(^=P=^^ + a + b + 2, Zt=*=±P) iffcodd, 



(2.2) a k 

[B^ 4 H , M , 4 

then the joint density of the (real) eigenvalues of the tridiagonal matrix 

(b\ a\ \ 

(2.3) J : = 



\ 



a n -i b. 



n ) 



with entries 



(1 - a 2 fc-i)a2fc - (1 + a 2 fc-i)a 2 fc-2 ' 

1 /2 

a k+1 = {(1 - a 2 fc-i)(l - al k )(l + a 2 k+i)} 



(a 2n -i = oc-x = a_ 2 = — 1) is given by the Jacobi ensemble (jl.ip . In the following discussion we 
consider the asymptotic properties of the eigenvalues of the random matrix (12.31) [or equivalently 
of the Jacobi ensemble (11.11) ]. where n —>■ oo and the parameters a, b and (3 in (11.11) also vary 
with n. An important tool of our analysis are the Jacobi polynomials Pn ,S \x), which are 
defined as the polynomials of degree n with leading coefficient (n + 7 + 5 + l) n /2 n n\ satisfying 
the orthogonality relation 



(2.4) 



P^' S) (x)P^' S \x)(l-xy(l+x) 5 dx = if m^n 



[see Szego (1975)]. Here and throughout this paper (a) n = T(a + n)/T(a) denotes the Pochham- 
mer symbol. The following results provide an almost sure uniform approximation of the random 
eigenvalues of the Jacobi ensemble by the roots of orthogonal polynomials, if n — > 00. We begin 
with a statement of an exponential bound for the probability of a maximal deviation between 
the roots of the Jacobi polynomials p(( 2a+2 ^P~ 1 >( 2b+2 "P~ r ) (x/2) and random eigenvalues of the 
Jacobi ensemble. 
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Theorem 2.1. Let X\ < ■ ■ ■ < X n denote the ordered eigenvalues with density given by the 
Jacobi ensemble / II. 1)) and < • • • < xi™' denote the ordered roots of the Jacobi polynomial 

^2o+2_^ 26+2 

then the following inequality holds for any e e (0, 1] 

P ( max |A$ n) - 4 n) | > e] < A(2n - 1) exp { ( log(l + — -) — (a + b + 2)1 . 

(2.5) 



Proof of Theorem 2.1. Interchanging the rows and columns of the matrix J defined in f |2.3j) 
it follows that the matrix 



(2.6) 



with entries 



J 



a' 1 



a' 1 



a n-l 



\ 



a n-l 
b'n J 



(2.7) 
(2.8) 



b' 



fc+i 



a- k+i 



Jn—k 



Q"n—k 



— (1 — «2n-2fc-3) a 2n-2/c-2 — (1 + «2n-2A:-3) a; 2n-2fc-4 , 

1 /2 

-1 = {(1 - «2n-2fc-5)(l - aL-2fc-4)( 1 + ®-2n-2k-z)} 



has the same eigenvalues as the matrix J. This implies that the joint density of the eigenvalues 
of the matrix J is also given by the f3- Jacobi ensemble defined in ( II. ip . We now consider the 
deterministic matrix 



(2.9) 



D :-- 



( d\ Ci 

V 



\ 



' • c n _i 

Cn— 1 d n J 



where we essentially replace the random variables in (12.71) and (12.81) by their corresponding 
expectations, that is 

dk+l '■= (1 — E[tt2n-2A:-3])E[Q;2n-2fc-2] ~ (1 + E[ct2„_2fc-3] )E[a 2n -2fc-4] j 

1 /2 

c k+1 := {(1 - E[a 2 n-2fc-5])(l - E[a 2 „- 2 fc-4] 2 )(l + E[a 2 „-2fc-3])} 



4 



A straightforward calculation [observing that the expectation of a random variable with density 
( 12. II) is given by (q — p)/(p + q) ]yields 

2(7, 2 - h 2 ) 

(2.10) 4+i = = , k = 0,...,n-2; 

[2k + a + b){2k + ~a + b + 2) 



(2.11) c k+l 



I J (k + a + b + l)(k + h + l)(jfc + b + l)(k + 1) ' ' " 



2k + a + b + 2 (2k + a + b + 3){2k + h + b + 1) 



(A; = 0, . . . , n — 3) where a = ^jp, 6 = and (observing that a_i = a_ 2 = — 1' 

(2.12) d n = 2(b ~~ a ) , 
V ; 2n + a + b - 2 

(2.13) = j_f(n + a -l)(n + 6-l)(n-l^ 1/a 

2n + a + 6-2[ (2n + a + b - 3) 

For the calculation of the eigenvalues of the matrix D we consider the determinant det(x I n — D), 
then it follows by an expansion with respect to the last row that 

(2.14) det(x I n - D) = (x - d n ) G n -x(x) - (? n _ x G„_ 2 (a;) , 

where the polynomials Gq(x), . . . , G n _i(x) are defined recursively by the three term recurrence 
relation 

(2.15) G k (x) = (x - d k ) G k -i(x) - (? k _ x G k - 2 {x) 

(k = l,...,n— 1; Gq{x) := 1, G-\(x) :— 0). Now a straightforward calculation and a 
comparison with the three term recurrence relation for the monic Jacobi polynomials [see e.g. 
Chihara (1978), p. 220] yields 

(2.16) G k (x) =2 k Pf b \\x)- fc = 0,...,n-l, 
where Pj: a ' b \x) denotes the kth monic Jacobi polynomial, i.e. 

(2.17) Pf b \x) = (h J"~ - Pt b \x) • 

(k + a + b + l) fc 
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Combining equations (I2.15p . (I2.16P and (I2+7P we obtain by a tedious but straightforward calcu- 
lation 

det(J„ x - D) = G n (x) + (s n - d n ) G n _i(a:) + (r^_ x - G n - 2 (x) 



n + a + b + l) n ^ (2n + a + b-2)(n + a + b) 

(2n + a + b)(n + a - l)(n + 6-1) p(a,S)(i \ 
(2n + a + b - 2)(n + a + b)(n + a + b - 1) n ~ 2 ^ X > 



>~ n, -+- a -+- h 



(n + a + b + l) n L ' ~ n + a + b 



{d ' b) ^x) 



{2n + a + b)(n + a-l) p ^i,i),i x >, 
(n + a + b)(n + a + b-l) n ^ 2%/ 
where we have used the identity 

(2+8) {n + b - l)P£§(x) = {n + a + b - l)P^$(x) -{2n + ~a + b- 2)P^\x) 

in the last step [see Abramovich and Stegun (1965), equation (22.7+8)]. A further application 
of this identity to the second polynomial yields 

[n + a + b + l) n 1 n + a + o 

(2n + 5 + 6) (n + a - 1) D (a-i,6) , 1 



(n + a + 6)(n + a + b - 1) 



'"" ! (2n + d+ ^ P (a-i,b)^ _ 2n + 5+ , 6 p t fs-i.^(i x ) 



n + a + 6 + l) n L-n + a + fo n + a + 6 

(2n + 5 + &)(2n + a + 6- l) p( a-i,5-i) ( i x 1 
(n + a + 6)(n + a + 6- 1) n 2 J 

A n n\ 

-p^-^y), 



(n+(5-l) + (6-l) + l)„ 
where we have used the identity 

(2.19) (n + a - l)ift 1,5) (z) = (2n + a + 6 - l)^ 3 " 1 ' 5 - 1 ^) - (n + a + 6 - l)P^ h \x) 

for the second equality [see Abramovich and Stegun (1965), equation (22.7.19)]. Consequently, 

In) (n) 

the eigenvalues of the matrix D are given by the roots x\ < • • • < x n of the Jacobi polynomial 

2 : 



p^a l, b i)n x y similar argument as in Silverstein (1985a) now shows that 



(2.20) max \X^ ] - x[ n) \ < max V \J kj - D kj \ < 4 {3X n } 1/2 + 6X n 

Kj<n 3 3 l<k<n^ J J 

~ ~ j= l 



where the elements of the matrices J and D are denoted by Jy and D^, respectively, and the 
random variable X n is defined by 

(2.21) X n := max \a k -E[a k }\. 

0<k<2n-2 

This implies for the probability in Theorem 2.1 

(2.22) p(^x\Xf-xf\>e y j < P(2{3X n } 1 /2 + 3Xn> |)<p(3{ 3Xn }i/2 > |) 

2rc-2 
k=0 

whenever e G (0, 1]. Observing Lemma A.l in the Appendix and that a k = 1 — 2/3 k , where (3 k is 
the corresponding Beta distribution on the interval [0, 1], it follows that 

P (max | \f-xf | >e) < 4 >T exp (c ( 2n - 2 ~ - /? + a + & + 2)"j 
1 - j - n k=0 V / 

< 4(2n- l)exp(c (a + 6 + 2)) 

where the constant c = c(e) is given by 

/ 2 \ 2 

(2.23) c = log(l + 



648 + 2e 2 J 648 + 2e 2 ' 
This proves the assertion of the theorem. □ 

Note that the constant c in (I2.23P is negative, and Theorem 2.1 therefore indicates that the 
random eigenvalues of the Jacobi ensemble can be approximated by the deterministic roots of 
the Jacobi polynomial Pn 1 ' b ~ \\ x ) with a high probability if a + b is large. The following 
result makes this statement more precise and provides a strong uniform approximation of the 
eigenvalues of the Jacobi ensemble by the roots of the Jacobi polynomial Pn l ' b (fx), ^ the 
parameters in fll.ip converge sufficiently fast to infinity. The proof follows by similar arguments 
as the proof of Theorem 2.2 in Dette and Imhof (2007) and is therefore omitted. 

Theorem 2.2. Let < • • • < An denote the ordered random eigenvalues with density given 
by the Jacobi ensemble M.l\) with parameters a = a n ,b = b n , (3 = (3 n and < ■ ■ ■ < x^ denote 
the ordered roots of the Jacobi polynomial P^t" 1 ' bn where 

(2.24) a n = 2ad3 ~b n = 
If 

(2.25) lim = oo , 

n^oo log U 
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then the inequality 

\(n) (n) ' 

max A„- — x- 

l<3<n 



<[J^] S 



3 3 

holds for all n > 2, where S denotes an a.s. finite random variable. In particular, if 
(2.26) li min f^±^>0, 

n— >oo Tl 

then there exists an a.s. finite random variable S' such that the inequality 



max 

l<?'<n 



n 



holds a.s. for all n > 2. 

3 Asymptotic spectral properties 

In this section we will apply Theorem 2.2 to derive the asymptotic properties of the empirical 
spectral distribution 

n 

(3-1) ^(O^-E 1 ^^}' 

i=i 

where < • • - < denote the ordered eigenvalues of the Jacobi ensemble defined by (11.11) 
with parameters a = a n ,b = b n and (3 = (3 n . The results of Section 2 indicate that the empirical 
distribution function in (13.11) should exhibit a similar asymptotic behaviour as the empirical 
distribution function 

n 

(3-2) ^(0^-E 1 ^^} 

71 . , 
1=1 

of the ordered roots x± < ■ ■ ■ < x^ of the Jacobi polynomial Pn n ~ 1, 6n ~ 1 - ) (|x), where a n = 
(2a n + 2)/p n , b n = {2b n + 2)/f3 n . The asymptotic zero distribution of Jacobi polynomials has 
been studied by several authors [see e.g. Gawronski and Shawyer (1991), Elbert, Laforgia and 
Rodono (1994), Dette and Studden (1995) or Kuijlaars and Van Assche (1999) among many 
others], and we can use these results and Theorem 2.2 to derive the asymptotic properties of the 
spectrum of the Jacobi ensemble. The following result makes this statement more precise. 

Theorem 3.1. Consider the empirical distribution functions of the eigenvalues of the Jacobi 
ensemble U.l\) and the roots of the Jacobi polynomial Pn n 1,bn 1 \x/2) defined by \3.1\) and 
k3. ty) . respectively, and let (5 n )neN, (^n)neN denote real sequences with S n > such that the limit 



limF n p (U + £n ) = F(0 

n— >oo 



s 



exists at every coninuity point £ of F. If the conditions 

, , a n + b n 5„(a n + b n ) 

(3.3) — ■ > oo , > oo 

lOgn n-*oo lOgn n-^oo 



are satisfied, then 

\imF^5 n ^ + e n )=F(0 



almost surely at every continuity point £ of F. 

Proof. Let and G J n denote the empirical distribution functions of the rescaled roots 

(n) (n) 

corresponding to the Jacobi polynomial p^ n ~ l ' bn l '{^x) and of the eigenvalues 

corresponding to the Jacobi ensemble defined by (11. ip . respectively. The arguments presented in 
the proof of Theorem 2.1 show that G„ and G J n are the empirical distribution functions of the 
eigenvalues of the matrices 

A n '.= ~t~(Fj — e n I n ), 

On 
On 

respectively. Observing Lemma 2.3 in Bai (1999) we obtain for the Levy-distance between the 
distribution functions G^ and G J n 

l W j < s-S-± i a<»> - *<-> i 2 < KH m w-*t>\r < 4(t^t^) V2 ) ■ 



1=1 



where S denotes an a.s. finite random variable. Consequently, we obtain from the assumptions 

L(G n , G J n ) ► , 

n— +oo 

and the assertion of Theorem 3.1 follows observing the identities 

G P n {i) = F n P (U + £n), 
G J n (0 = F^dni + Sn). 

□ 
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Theorem 3.2. Let A^ < • • - < An denote the ordered random eigenvalues with density given 
by the Jacobi ensemble / II. If the assumptions of Theorem 3.1 are satisfied and that there exist 
constants a±, a-i G 1R b\, b 2 G M + siic/j i/iai 

1/ n + 6 n — 1 \ ai 



lim 

n-oo On \ 2n + a n + 6 n - 2 / 2 



1 / n(n + a n - 1) + (n + b n - l)(n + a n + b n - 2) \ a 2 

lim — = £ n \ = — 

fin \ (2n + ~a n + b n - 2) 2 y 2 

1 (n + 6 n - l)(n + a n - l)n bi 



n—>co 



[Hi) lim 
(iv) lim 



52 (2n + a n + 6 n - 2) 3 4 
1 (n + 6„ — l)(n + a n — l)(n + a n + 6„ — 2)n 6 2 



^ (2n + a„ + b n - 2) 4 

t/ien the empirical distribution of the scaled eigenvalues 

A W - 2 (2e n - 1) Aj n) - 2(2e n - 1) 

20n 20^ 

converges almost surely to a non- degenerate distribution function, i.e. 



lim F n J (25„£ + 2(2e n - 1)) =' / /( ai ' a2 > fel ' b2 )(x)dx 



where 



\h y/4b 2 -(x-a 2 ) 2 i T _„ |< 

j(ai ,02,61,62)^ _ J 2vr (fe 2 -fei)x 2 +(bia2+feiai-2fe2ai)x+6 2 a 2 -aia2bi+^ 7 1 2 U V 2 

e/se. 



Proof. By Theorem 2.2 in Dette and Studden (1995) it follows that the empirical distribution 
of the roots of the Jacobi polynomial P^ n 1 ' bn {\x) has a non-degenerate limit, that is 

F„ P (2U + 2(2e n - 1)) ► t f^M(x)dx . 

The assertion is now an immediate consequence of Theorem 3.1 □ 



Example 3.3. Assume that 



lim — = ceo and lim — = f} Q 

n—*oo Jl n— >oo fi 



for some constants a , A) — 0- If additionally (a n + 6 n )/logn — > oo, it is easy to see that the 
assumptions of Theorem 3.2 are satisfied with 5 n = \, e n = ~ (n G N). Consequently, it follows 
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that the empirical spectral distribution of the Jacobi ensemble ( II. ip converges almost surely to 
a distribution function with density 

2 + a + /3 y/(2r 2 -x)(x-2 ri ) 

Ja ,Po\ X ) - — 4- x 2 1 (2r 1 ,2r 2 ){X) , 

where 

(31 - ag - 4 v /(a + l)(^o + l)(ao + /3o + l) 



n 



r 2 :-- 



(2 + a + /3 y 
Pi - a 2 + V(a + 1)(A) + l)(c*o + A) + 1) 



(2 + a + /3 ) 2 
For example, if a n = 6 n = 3n and /3 n = 2 we have 

(3.4) lim — = lim — = 3 

n— >oo fl n— >oo 77, 

and the limiting spectral distribution has the density 



2 



/3,3(z) = ^ n — ^(aj) , 

which is depicted in the left part of Figure [TJ Similarly, if a n = b n = y/n and (3 n = 2n we have 

lim ~— = lim — = , 

n— »oo ?7, n—roo fl 

and the limiting distribution is given by the arc-sine law on the interval [—2, 2] with density 

/o,o(a0 = \ J—^j / (-2,2)(^) , 

7T V4 — 2T 

displayed in the right part of Figure [U 

Example 3.4. If the parameters a n and 6 n converge to infinity such that 

hm — = oo, lim — = oo, lim — = 7 > , 

n^oo fl ri— >oo 77, n— »oo Jj 

and additionally 

(3.5) /3 n ~ C a n = (n 1+ ») , 

for some C G M + , v G (0, 1) it follows that the scaled empirical distribution of the eigenvalues 
of the Jacobi ensemble (11.11) with parameters a n , b n and f3 n converges almost surely to Wigner's 
semi circle law, that is 



n^oo 



hm F J n f 2A^—i - 2 <K ] = ^ / V^x^dx \Z\<v, 



" 1 W On - 1 d n + b n -2 vra 2 J_ a 
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Figure 1: Density of the limiting distribution of the random eigenvalues corresponding to the 
Jacobi ensemble and a histogram based on n = 5000 eigenvalues from the Jacobi ensemble U.l\) . 
Left panel: a n = b n = 3n, (3 n = 2; right panel: a n = b n = y/n, f3 n = 2n. 

where o = 47/(1 + 7) 3//2 . Note that assumption (13. 5p guarantees that the second condition (13. 3p 
in Theorem 3.1 is satisfied, i.e. 

-4 



/ n a n + b n 

lim a — = oo. 

n-*co y a n — 1 logn 

This situation is also of particular interest in the case, where the inverse temperature f3 n converges 
to 0. For example, if a n = b n = n — 1 and (3 n = 2n~ 1 ^ 4 we have a n = b n = n 5//4 and the condition 
(13. 3p is satisfied. The limiting distribution of the scaled eigenvalues 



is then given by 

f(x) = -V2 - x 2 I^^ix). 



Example 3.5. We now consider the case, where the sequences a n and b n converge to infinity 
with different rates. If 

lim — = oo , lim — = /3 Q > , 

n^oo fi n— >oo fl 

and the sequences (/3 n ) ne ^, (a n ) ng N satisfy 

(3 n ~ C, ~a n = (n 1+ ») 
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for constants CeR + and v G (0, 1/3), then it is easy to see that the conditions of Theorem 3.2 
are satisfied with 

Tl 



a n -V 

and it follows by a straightforward calculation that 



hm Fl ( J^-i - 2] = J- f Vis* -*)(*- 'J Sl < ^ < S2 , 



n^oo 



On-1 J 47T J Sl X 



where 



Similarly, if 



Sl :=2(2 + A,)-Vl+A), 
s 2 :=2(2 + A,)+ Vl + #, • 



hm — = 00, hm — = 00, hm — = 00 , 

rt^oo Tl n-^co Tl n^oo Jj 



and for some v G (0, 1) there exist constants C, C 1: C 2 > and a /i > 3/2u — 1/2 such that 
(3.6) /3 n ~C, a n <C in 1+u , ~b n >C 2 n 1+ », 

then we obtain for any — 2 < £ < 6 



, ;n(b n — 1) a n + 2\ n(b n — 1) — b n \ 1 K , 



2 



4 An application to the multivariate F distribution 

Let Xij (i = 1, . . . , n; j = 1, . . . , rii, n\ > n) and (i — 1, . . . , n; j — 1, . . . , n 2 ; n 2 > n) 
denote independent standard normal distributed random variables and consider the random 
matrices 

(4.1) X n = i.Y,^ ; Z ;; : G R"x- ; Y n = lY,,)' ^ G R nx ™ 2 . 
The matrix 

(4.2) F n := (-X n xA (-YrXf] ' G K" x " 



ni / \n 2 

is called multivariate F-matrix and plays a prominent role in the multivariate analysis of variance 
[see e.g. Muirhead (1982)]. Silverstein (1985b) showed that under the conditions 

Tl Tl 

lim — = y > lim — = y' G (0, 1) 

n^oo Tli ra— >oo Tl 2 
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the empirical distribution of the eigenvalues of a multivariate F-matrix converges in probability 
to a non-random distribution function, say F y>y i, with density 

(4-3) fy,y,(x) = 2n ^ x j + y^ ^- ~ , 

where the bounds of the support are given by 

■1-^/1- (1 - - 2/0 

si - ' 



i-y' 

1 + v/l - (1 -2/')\ 2 



1 - y' 

Moreover, if y > 1 the limiting distribution has mass 1 — 1/y at the point 0. A corresponding 
result for the expectation of spectral distribution of the matrix F n can be found in Collins 
(2005). In the following discussion we will extend these results in two different directions using 
the methodology developed in Section 2. On the one hand, we prove that in the case y G (0, 1] 
these results are also correct, if almost sure convergence is considered. On the other hand, we 
extend these results to the case where n, n 1; n 2 are not necessarily of the same order. 

Theorem 4.1. Consider the multivariate F-matrix defined by ( [^.-Sp . If 

n n 

(4.4) lim — = y G (0, 1] , lim — = y' G (0, 1), 

n^oo m n-*oo Tl2 

then the empirical distribution function of the eigenvalues of the matrix F n converges a.s. to to 
a distribution function with density |J3p. 

Proof. Consider the matrix 

(4.5) A n := 2 (Y n Y? - X n Xj) (Y n Y? + X n Xj) _1 , 

then it follows from Muirhead (1982) that the joint density of the eigenvalues of A n is given by 
the Jacobi ensemble (11.11) with (3 = 1 and 

1 . 

a n = ^{ni -n-1) , 
b n = ^(n 2 -n-l). 

If A F denotes an eigenvalue of the matrix F n we obtain with some appropriate constant C G M 
the identity 

(— — X F 
2 "Srpv" /n ~ 2 ^ ~ XnX ^ ( YnY ™ + XnX * )_1 

= det (\ F I n - ^ (X n Xl) (Y n Y^ • det (Y n Y?) .(7=0 
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which shows that 

— — 

(4-6) = 2 

m ' 

is an eigenvalue of the matrix A n . Consequently, the empirical distribution function of the 
eigenvalues of the matrix F n satisfies the relation 

(4-7) ^(0 = 1-^(2^4) , V£>0 



Til 



where denotes the empirical distribution function corresponding to the Jacobi ensemble (11. ip 
with parameter (3 = 1 (note that only the case £ > is of interest here). We now use Theorem 
3.2 to derive the limiting spectral distribution. For this purpose we identify 

- ., - 2a n + 2 
a n — 1 = 1 = m — n , o n — 1 = — 1 = n 2 — n 

and obtain the limits 

rt n - n _l_ 1 

y' 1 - 1 > o , 

y'- 1 - 1 > . 



'n 2 










m - 


n , 




lim — = 


lim 


m 










n 


,. K 

lim — = 


lim 


n 2 


-n + 1 


n—>oo n 


n—roo 




n 



Moreover, the assumption 

,. a n + b n 

lim — ■ = oo 

n—*oc log n 

is obviously satisfied. From Example 3.3 it therefore follows that the empirical distribution 
function of the eigenvalues of the matrix A n converges a.s. to a distribution function F with 
density 

s y^ + y'- 1 ^/{2r 2 -x){x-2n) 

= 2^ 4^ W^UO , 

where T\ and r 2 are given by 

(y - yy'f - W - yy'f - Wy 2 y' 2 (y + ¥- yy') 



n :-- 



r 2 :-- 



(y + y'Y 

{y - yy') 2 - W - yy') 2 + ^\/y 2 y' 2 (y + y'-yy') 



(y + y') 2 

Observing the relation (14.71) we obtain 

>y - y'i 



i?(0 ^ 1 - F ( 2 



y + y'U 
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and the assertion of the theorem now follows by a straightforward but tedious calculation of the 
density of the limiting distribution. □ 



While the preceding theorem essentially provides an alternative proof of the results of Silverstein 
(1985b), the following three theorems extend Silverstein's findings to the case where y,y' = 0. 

Theorem 4.2. Consider the multivariate F-matrix defined in (£W and denote by Af < • • • < A^ 

the corresponding eigenvalues. If 

hm — = , hm — = , hm — = 7 > 

and 

n x = 0(n 1+u ) 

with v e (0, 1), then the empirical distribution function of the transformed eigenvalues 



m = 2 x 1^ — -p- — } i = i,. ..,n 

n \ni+n 2 — 2n ^iAf + n 2 



converges a.s. to a distribution function with density 



2 

/ 7 (x) = — -v a 2 — x 2 I {—a < x < a} , 



no 2 



where a = 47/(1 + 7) 3 ^ 2 - 

Proof. Recall the definition of the matrix A n in (14. 5p . which corresponds to the Jacobi ensemble 
(II. ip with (3 = 1, a n = — n — 1), b n = \{n 2 — n — 1). Using the notation (12.241) we obtain 
a n — 1 = ni — n, b n — 1 = n 2 — n. By the assumption of the theorem we have 

1 . O n b n <X n 

hm — = 00 , lim — = 00 , hm — = 7 

and a n = 0(n 1+u ). Therefore it follows from Example 3.4 that 



n— >oo 



limF n J 2 J C-2- 



n\ — n ni + n 2 — 2n 



ira 2 



/ Vc 2 — s 2 / {—cr < x < cr} (ix 
./ 00 



a.s., where F^ 7 denotes the empirical distribution of the eigenvalues of the matrix A n . The 
identity (14. Tf) implies for £ > — 2 



n x 2 + £ 
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and therefore it follows 



n^oo \ Tl\ \ 2 ri2—n 



— 1 I I = f \Jo 2 - x 2 I {-a < x < a}dx 



a.s., which proves the assertion of the theorem. □ 

Theorem 4.3. Consider the multivariate F -matrix and denote by Af < • • • < A^ the corre- 
sponding eigenvalues. If 

lim — = , lim — = y' e (0, 1] 

n— >oo m n— >oo fl 2 

and 

m = 0(n l+ ») , 

with v e (0, 1/3), then the empirical distribution function of the scaled eigenvalues 

^ i = T( V \ X i— + 1 i = l,...,n 

2(n 1 - n) \ n 2 J 

converges a.s. to a distribution function F with density 

, / n 1 y/{xs 2 - 1)(1 -x Sl ) 

tu'lX) = 1,-1 -n X , 

where the bounds of the support of the density are given by 

s 1 :=2{y'- 1 + l)-A^/~\ 



S2 



:=2(y / - 1 + l)+4 v ^. 



The proof is analogous to the proof of Theorem 4.2, using the first result of Example 3.5. 
Similarly, the following theorem can be proven using the last statement in Example 3.5. 

Theorem 4.4. Consider the multivariate F -matrix and denote by Af < • • • < A^ the corre- 
sponding eigenvalues. If 

n n rii 

hm — = , hm — = 0, hm — = oo 

n— »oo ri\ n— >oo fl 2 n—>oo fl 2 

and for some v e (0, 1) there exist constants C 1 , C 2 > and a \i > 3/2u — 1/2 such that 

m < C in 1+U , n 2 > C 2 n 1+ ", 
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then the empirical distribution function of the scaled eigenvalues 

n m - n ^(n 2 - y/n(n 2 - n))Af - (m + y/n(n 2 - nj) . 

n- = ^ ^ ^ — 

m + na v/n(n 2 -n)(l + ^Af) 

converges a.s. to a distribution function F with density 

f(x) = ^(Q + x)(2-x)I W] (x) . 



5 Appendix: auxiliary results 



Lemma A.l. Let Z denote a Beta- distributed random variable on the interval [0, 1] with density 
(5.1) 

then for any 8 > 



P(\Z- E[Z) \>S)< Ae c{p+q) 
where the constant c is defined by 

c = log 1 + 



3 + 25 / 3 + 25 



Proof. If X ~ T(p, p + q), Y ~ p + g) denote independent Gamma-distributed random 
variables, it is well known that the ratio Z = Xj (X + V) has a Beta-distribution with density 
(EE}. Because = £[X] = p/(p + g) it follows that 

(5.2) P(\Z - E[Z]\ > 5) = P(\x^y - E[X\\ > s) 

Define 8' = 5/(3 + 25) and assume that 

| X - E[X] | < 8' , \Y - E[Y] | < 8' , 
then it is easy to see that \ X + Y - 1 \ = \ X + Y - E[X + Y] \ < 28' ; and 



X 



E[X] 



< 



1-25 



X + Y 

This implies for the probability in (15.21) 
X 



7 (|X - E[X]| + E[X]|X + F - E[X + F]|) < T -^ 7 35' = 5 . 



(5.3) P 



X + Y 



E[X] 



> 8^j < P(X > E[X) + 8') + P(X < E[X] - 8') 

+ P(Y > E[Y] + 8') + P(Y < E[Y] + 8') . 
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Using similar arguments as in Dette and Imhof (2007) we obtain the estimates 

P(U >E[U]+ 5') < exp{(p + g)(log(l + 5') - 5')} , 
P{U < E[U] - 5') < exp{(j9 + g)(log(l - 6') + 5')} , 

where U is either X or Y . The assertion of Lemma A.l now follows from (15.31) and the definition 
of 5' observing that log(l + 5') - 5' > log(l - 5') + 5'. □ 
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